PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSMUA_Achr8P27340_001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Zingiberales; Musaceae; Musa
Family HD-ZIP
Protein Properties Length: 749aa    MW: 83854.2 Da    PI: 7.0612
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSMUA_Achr8P27340_001genomeCIRADView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.61.6e-2191146156
                            TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
               Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                            r+k +++t+eq++e+e+lF+++++p++++r++L+++lgL+ rqVk+WFqNrR++ k
  GSMUA_Achr8P27340_001  91 RKKYHRHTAEQIREMEALFKESPHPDEKQRQQLSNQLGLSARQVKFWFQNRRTQIK 146
                            7999************************************************9877 PP

2START170.41.2e-532654846206
                            HHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEE CS
                  START   6 aaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kae 80 
                            a++el k+a+a+ep+Wv+s+    e++n+de++++f+++ +     +++ea+r++g+v+ ++++lv+ ++d++ qW+e ++    ka 
  GSMUA_Achr8P27340_001 265 ALEELTKMATAQEPLWVRSVetgrEILNYDEYVKEFSPDMSrngcvRNIEASRETGIVFFDMPRLVQAFMDVN-QWKEFFPclisKAV 351
                            789***********************************98899******************************.************** PP

                            EEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCE CS
                  START  81 tlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksngh 161
                            +++ is+g      g++qlm+ae q+l+plvp R+ +fvRy+++l   +w+i+d+S+d  +++  ++s+++++++pSg++ie+ + gh
  GSMUA_Achr8P27340_001 352 IVDIISKGlgdskdGTIQLMFAEIQMLTPLVPtREIYFVRYCKKLCPTRWAILDISIDKLEENI-DASLMKCRKRPSGCIIEDQDTGH 438
                            **************************************************************98.9********************** PP

                            EEEEEEE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                  START 162 skvtwvehvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                            +kvt ++      +   + l++++v sgla+ga++w+atl+ qce+
  GSMUA_Achr8P27340_001 439 CKVTQLSYSCFPLHCGiPTLYHPIVTSGLAFGARHWMATLRLQCER 484
                            ****987655444444469*************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.605.6E-2175142IPR009057Homeodomain-like
SuperFamilySSF466891.84E-1984149IPR009057Homeodomain-like
PROSITE profilePS5007118.20488148IPR001356Homeobox domain
SMARTSM003897.8E-1990152IPR001356Homeobox domain
PfamPF000466.1E-1991146IPR001356Homeobox domain
CDDcd000865.36E-1891146No hitNo description
PROSITE patternPS000270123146IPR017970Homeobox, conserved site
PROSITE profilePS5084832.886251487IPR002913START domain
SuperFamilySSF559611.92E-26255484No hitNo description
CDDcd088752.65E-91257483No hitNo description
SMARTSM002342.5E-50260484IPR002913START domain
PfamPF018521.4E-42264484IPR002913START domain
SuperFamilySSF559616.18E-12527715No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 749 aa     Download sequence    Send to blast
MHQSSMNNNN TPVKDFFASP ALSLSLAGVF RNNAAAVVDV EEGDEASKGG CQREQAEISG  60
ENSGPAGRSD EDRESNESQE ENREVGNNRK RKKYHRHTAE QIREMEALFK ESPHPDEKQR  120
QQLSNQLGLS ARQVKFWFQN RRTQIKAVQE RHENSLRKSE IEKLQEENRT MREKIKKGCC  180
PNCGYTTLSN GTTITTEEQQ HHIENTRLKA EIKKLRRMLG SIPDGNTSPS SSCSAGADQN  240
KSSLDSCSRF LGPEKFRILE IVNVALEELT KMATAQEPLW VRSVETGREI LNYDEYVKEF  300
SPDMSRNGCV RNIEASRETG IVFFDMPRLV QAFMDVNQWK EFFPCLISKA VIVDIISKGL  360
GDSKDGTIQL MFAEIQMLTP LVPTREIYFV RYCKKLCPTR WAILDISIDK LEENIDASLM  420
KCRKRPSGCI IEDQDTGHCK VTQLSYSCFP LHCGIPTLYH PIVTSGLAFG ARHWMATLRL  480
QCERSVFFMA TNVPTRDCNG VSTLAGRKSI LKLGQRMTSC FCQNIGASGH HKWTKVSTKG  540
GDEIRFTSRK NINDPGEPLG LIICSVLSTW LPVPAMSLFN FLRDDSRRTE WDIMLTPSPT  600
QTMVNLVKGQ DRGNSVTIYS LQTTTSSERT NIWVLQDCST NSYESMVVFA PVEIDGTQSV  660
MNGCDSSSLA ILPSGFSILP DGLETRPLVI TSRPQERTME GGSLLTVAFQ ILADASPVAR  720
PTTESVETIN TLVSCTLQNI KKALQCEDG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
18892RKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009415234.10.0PREDICTED: homeobox-leucine zipper protein GLABRA 2-like
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLM0TU370.0M0TU37_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr8P27340_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP79938147
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein